ENH: Implemented MultiIndex.searchsorted method ( GH14833) #61435

GSAUC3 · 2025-05-12T17:35:52Z

closes Multiarray searchsorted fails #14833
Tests added and passed if fixing a bug or adding a new feature
All code checks passed.
Added type annotations to new arguments/methods/functions.
Added an entry in the latest doc/source/whatsnew/vX.X.X.rst file if fixing a bug or adding a new feature.

datapythonista · 2025-05-12T17:46:56Z

Thanks @GSAUC3 for the PR. Is there and issue, or has there been any discussion about this elsewhere?

GSAUC3 · 2025-05-12T17:51:30Z

Hi @datapythonista, this was the issue #14833 , against which i made the pull request.

GSAUC3 · 2025-05-12T19:14:57Z

HI, I had ran pytest and pre-commit locally, is it possible to run all these test locally?

RB-zentronlabs · 2025-05-13T07:40:28Z

@GSAUC3 Add a return statement in your function after except block.

check teh docstring and proper typing hints, IndexOpsMixin check this class, where searchsorted has a properly defined parameter structure.

…er MultiIndex()

GSAUC3 · 2025-05-15T00:04:09Z

hi, @datapythonista , I am having trouble running all these tests, locally, before committing my code, so far i have only ran the pytest, locally, and that worked, could you please, guide me, how to set up the testing environment locally, before each commit?

will running
pre-commit run --all-files
suffice?

datapythonista · 2025-05-15T08:20:52Z

pre-commit should run automatically if it's set up to work as intended. You have all the information on how to set up the development environment, run tests... in the development documentation: https://pandas.pydata.org/docs/development/index.html

RB-zentronlabs · 2025-05-15T08:27:28Z

Hi, @datapythonista, this part of the error messages tells, us that searchsorted method should fail, but it is passing, am i correct?

=================================== FAILURES ===================================
__________________________ test_searchsorted[tuples] ___________________________
[gw0] darwin -- Python 3.10.17 /Users/runner/micromamba/envs/test/bin/python3.10
[XPASS(strict)] np.searchsorted doesn't work on pd.MultiIndex: GH 1[48](https://github.com/pandas-dev/pandas/actions/runs/15033468287/job/42250710830?pr=61435#step:5:52)33
___________________ test_searchsorted[mi-with-dt64tz-level] ____________________
[gw0] darwin -- Python 3.10.17 /Users/runner/micromamba/envs/test/bin/python3.10
[XPASS(strict)] np.searchsorted doesn't work on pd.MultiIndex: GH 14833
___________________________ test_searchsorted[multi] ___________________________

datapythonista · 2025-05-15T08:36:08Z

Hi, @datapythonista, this part of the error messages tells, us that searchsorted method should fail, but it is passing, am i correct?

Yes, that's correct. I guess we have an xfail for the test that should be removed.

GSAUC3 · 2025-05-17T17:53:03Z

Hi @datapythonista . Thank you for your suggestions, I've addressed the feedback from earlier and the CI checks are now passing. This PR should be ready for review whenever you get a chance. Please let me know if any changes are required. Thanks again!

datapythonista

Looks good, added few comments.

You will have to add a not in the whatsnew for 3.0. You can check other PRs to see how we do it.

datapythonista · 2025-05-19T13:16:16Z

pandas/tests/base/test_misc.py

+    # if isinstance(obj, pd.MultiIndex):
+    #     # See gh-14833
+    #     request.applymarker(
+    #         pytest.mark.xfail(
+    #             reason="np.searchsorted doesn't work on pd.MultiIndex: GH 14833"
+    #         )
+    #     )
+    if obj.dtype.kind == "c" and isinstance(obj, Index):


Can you remove this instead of commenting please?

Absolutely, I will do it right away.

datapythonista · 2025-05-19T13:16:50Z

.gitignore

@@ -141,3 +141,5 @@ doc/source/savefig/
 # Pyodide/WASM related files #
 ##############################
 /.pyodide-xbuildenv-*
+
+*.ipynb


Better to remove this. We do have some notebooks in this repo iirc.

Yes, I will remove it.

datapythonista · 2025-05-19T13:25:06Z

pandas/tests/indexes/multi/test_indexing.py

@@ -1029,3 +1029,28 @@ def test_get_loc_namedtuple_behaves_like_tuple():
        assert idx.get_loc(("i1", "i2")) == 0
        assert idx.get_loc(("i3", "i4")) == 1
        assert idx.get_loc(("i5", "i6")) == 2
+
+
+def test_searchsorted():


I think it would be better to divide this test. Ideally, when a test fails, we want to know what's wrong just by checking the test name, and also we want to still test everything else. With a single test, the first thing that fails will make the whole test fail. You can use pytest fixtures and parametrize to not repeat too much code.

Thank you for the suggestion, it is quite insightful for me. I have created three different functions as follows:

def test_searchsorted_single():... # for single inputs def test_searchsorted_lists():... # list of single inputs def test_searchsorted_invalid():... # for invalid inputs

Would this be the correct way to do it?

Yes, that may seem silly, but then when a test fails it's very easy to know what it fails. Ideally an assert per test, and if more than one input is asserted, then pytest.mark.parametrize to reuse a test with multiple inputs.

GSAUC3 added 4 commits May 12, 2025 22:48

sortedarray in side multi.py implemented, testing pending

cffb863

implemented the searchsorted() method, w.r.t issue pandas-dev#18433

1ba7ff8

modified test_searchsorted, discarded the use of numpy.testing

ac70f3e

applying pre-commit fixes

275b0e2

datapythonista added Enhancement MultiIndex labels May 12, 2025

GSAUC3 added 3 commits May 14, 2025 00:27

fixed the returned statement error

0e0b9b5

solved the mypy type checking error, and implemented searchsorted und…

4747609

…er MultiIndex()

Merge branch 'main' into searchsorted_branch

9ac62ab

GSAUC3 added 4 commits May 17, 2025 10:02

Closes the GH 14833 issue

e2c2c5e

pre-commit run successful, closes issue 14833

e88da57

fixed incompatible return type issue, and closes issue 14833

94f7c44

fixed typing checks; closes issue 14833

1f4a1c9

GSAUC3 changed the title ~~Searchsorted branch~~ Implemented MultiIndex.searchsorted method (bug #14833) May 17, 2025

GSAUC3 changed the title ~~Implemented MultiIndex.searchsorted method (bug #14833)~~ ENH: Implemented MultiIndex.searchsorted method ( GH14833) May 17, 2025

datapythonista reviewed May 19, 2025

View reviewed changes

updated whatsnew list, added more descriptive tests

ffd99d8

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ENH: Implemented MultiIndex.searchsorted method ( GH14833) #61435

ENH: Implemented MultiIndex.searchsorted method ( GH14833) #61435

GSAUC3 commented May 12, 2025 •

edited

Loading

datapythonista commented May 12, 2025

GSAUC3 commented May 12, 2025 •

edited

Loading

GSAUC3 commented May 12, 2025

RB-zentronlabs commented May 13, 2025

GSAUC3 commented May 15, 2025 •

edited

Loading

datapythonista commented May 15, 2025

RB-zentronlabs commented May 15, 2025 •

edited

Loading

datapythonista commented May 15, 2025

GSAUC3 commented May 17, 2025

datapythonista left a comment

datapythonista May 19, 2025

GSAUC3 May 19, 2025

datapythonista May 19, 2025

GSAUC3 May 19, 2025

datapythonista May 19, 2025

GSAUC3 May 19, 2025 •

edited

Loading

datapythonista May 19, 2025

ENH: Implemented MultiIndex.searchsorted method ( GH14833) #61435

Are you sure you want to change the base?

ENH: Implemented MultiIndex.searchsorted method ( GH14833) #61435

Conversation

GSAUC3 commented May 12, 2025 • edited Loading

datapythonista commented May 12, 2025

GSAUC3 commented May 12, 2025 • edited Loading

GSAUC3 commented May 12, 2025

RB-zentronlabs commented May 13, 2025

GSAUC3 commented May 15, 2025 • edited Loading

datapythonista commented May 15, 2025

RB-zentronlabs commented May 15, 2025 • edited Loading

datapythonista commented May 15, 2025

GSAUC3 commented May 17, 2025

datapythonista left a comment

Choose a reason for hiding this comment

datapythonista May 19, 2025

Choose a reason for hiding this comment

GSAUC3 May 19, 2025

Choose a reason for hiding this comment

datapythonista May 19, 2025

Choose a reason for hiding this comment

GSAUC3 May 19, 2025

Choose a reason for hiding this comment

datapythonista May 19, 2025

Choose a reason for hiding this comment

GSAUC3 May 19, 2025 • edited Loading

Choose a reason for hiding this comment

datapythonista May 19, 2025

Choose a reason for hiding this comment

GSAUC3 commented May 12, 2025 •

edited

Loading

GSAUC3 commented May 12, 2025 •

edited

Loading

GSAUC3 commented May 15, 2025 •

edited

Loading

RB-zentronlabs commented May 15, 2025 •

edited

Loading

GSAUC3 May 19, 2025 •

edited

Loading